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To date, the best classical algorithm for performing error correction in the surface code has been 
minimum-weight perfect matching. However, in this work we present a Markov chain Monte Carlo 
algorithm that achieves significantly lower logical error rates. It therefore allows any target logical 
error rate to be obtained using a significantly smaller code. This increase in performance does come 
at the cost of an increased runtime complexity, but only by a polynomial factor O(L^) for e < 2. Our 
algorithm is based on an analytically exact rewriting of the probability of each logical equivalence 
class, which also suggests that for arbitrary stabilizer codes error correction can be performed to 
arbitrary accuracy in a runtime 0(poly(L)). It is applicable to any stabilizer code, allows for 
parallelization, and can be used to correct in the case of imperfect stabilizer measurements. 



INTRODUCTION 

An important primitive for the processing of quantum 
information is the ability to store it despite constant cor- 
ruptive influence of the external environment on the ap- 
plied hardware and imperfections of the latter. While 
one approach seeks to achieve this by constructing a self- 
correcting quantum memory (see Ref. [1^ for a recent re- 
view) , an alternative possibility is to dynamically protect 
the stored quantum information by constantly pumping 
entropy out of the system. Topological quantum error 
correction codes [2l |3 store one logical qubit in a large 
number of physical qubits, in a way which guarantees 
that a sufficiently low density of errors on the physical 
qubits can be detected and undone, without affecting the 
stored logical qubit. Most promising is the surface code 
[IHl], which requires only local four-qubit parity opera- 
tors to be measured. While proposals for direct measure- 
ment of such operators exist jSHTOj , most of the literature 
focuses on time- dependent interactions between the four 
qubits and an auxiliary qubit, allowing to perform se- 
quential CNOT gates and to finally read the measure- 
ment result off the auxiliary qubit. See Ref. [TP for a 
recent review. 

In order to decode the syndrome information, i.e. use 
the outcomes of all four-qubit measurements to find out 
how to optimally perform error correction, a classical 
computation is necessary. This classical computation is 
not trivial and brute force approaches are infeasible. De- 
coding algorithms based on renormalization techniques 
[12] or minimum- weight perfect matching (MWPM) [13] 
have a runtime complexity 0{L'^) and can be parallelized 
to 0{L^) (neglecting logarithms), where L is the linear 
size of the code. As these algorithms are approximative, 
the logical error rates achievable with them fall short 
of those theoretically achievable by brute force decod- 
ing. A Markov chain Monte Carlo (MCMC) algorithm 
[14 can cope with higher physical error rates than the 
two mentioned algorithms, but has super-polynomial (yet 
sub-exponential) runtime complexity. In this work, we 
present an efficient MCMC decoding algorithm that al- 



lows to achieve logical error rates lower than those achiev- 
able by means of MWPM [15 . Equivalently, a smaller 
code size is required to achieve a certain target logi- 
cal error rate. Our algorithm allows for trade-offs be- 
tween runtime and achieved logical error rate. We define 
its runtime to be the minimal computation time such 
that the achieved logical error rate is lower than the one 
achievable by means of MWPM. Empirically, we find a 
runtime complexity 0{L^) with a < 4 depending on 
the error model. The runtime complexity can be par- 
allelized to 0(L"~^), thus adding complexity 0(L"~^) 
to MWPM and renormalization technique decoding al- 
gorithms. In summary, in comparison to alternative al- 
gorithms [m [13 our algorithm allows for lower quantum 
information error rates and smaller code sizes at the cost 
of a higher classical runtime complexity. Given the cur- 
rent state of the art of quantum and classical information 
processing, shifting requirements from quantum to clas- 
sical seems desirable. Our algorithm is generalizable to 
the (realistic) case of imperfect stabilizer measurements, 
though we restrict numerical simulations in this work to 
the case of perfect measurements for simplicity. 



ERROR CORRECTION IN SURFACE CODES 

Stabilizer operators are, in the context of the surface 
code, tensor products of or operators (see Fig. [T]) 
which are required to yield a +1 eigenvalue when applied 
to the quantum state stored in the code. Eigenvalues —1 
are treated as errors and interpreted as the presence of 
an any on. A surface code of size L has ngtab = 2L(L — 1) 
(3- and 4-qubit) stabilizers. Since all stabilizers com- 
mute, they can be measured simultaneously and hence 
the presence of anyons can be detected. Any Pauli op- 
erator cr^, cr^, or applied to a data qubit creates at 
least one anyon as it anti-commutes with at least one 
stabilizer. We call violated a^-stabilizers s-anyons and 
violated <j^-stabilizers p-anyons. 

Given some anyon configuration A, the goal is to ap- 
ply a series of single-qubit and operators, such 
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FIG. 1. An L = 4 surface code. Black dots are data qubits, 
grey dots are syndrome qubits that allow to read off the re- 
sults of the stabilizer measurements when sequential CNOT 
gates have been performed between them and the adjacent 
data qubits. Stabilizer operators are either tensor products 
of operators (acting on the data qubits around a white 
square/triangle) or tensor products of operators (acting 
on the data qubits around a blue square/triangle). 



that all anyons are removed and a trivial operation has 
been performed on the code subspace. Two such hy- 
potheses about what errors the physical qubits have suf- 
fered are equivalent if they can be deformed into each 
other through the application of stabilizers. Equivalent 
error chains will lead to the same operation performed 
on the code subspace (consisting of the states which are 
+1 eigenstates of each stabilizer). For the surface code, 
there are four such equivalence classes. The goal is there- 
fore to find the most probable equivalence class of error 
chains and not to find the most likely error chain. The 
most likely error chain need not be an element of the 
most likely equivalence class, though trying to correct by 
undoing the most likely error path is a reasonable approx- 
imation and is the idea behind minimum-weight perfect 
matching correction algorithms. More precisely, MWPM 
matches both kinds of anyons independently of each other 
and thus ignores potential correlations between <j^- and 
cr^-errors. 

Decoherence models in which each qubit independently 
is subject to the channel 
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(with Pi ^ Px ^ Py ^ Pz = 1) allow for efficient simu- 
lation on a classical computer. While physical decoher- 
ence models may not exactly have the form of Eq. ([T]), 
they may be approximated by such a channel through 
a PauU twirl approximation [161 IIZ] • The two most fre- 
quently studied noise models of the form of Eq. ([T]) are 
independent hit- and phase-flip errors {px = ^5(1 — Pp)^ 



Pz = — Pb) and Py = pbPp for independent bit- and 
phase-flip probabilities and Pp) and depolarizing noise 
{Px = Pz = Py = f)- The theoretical maximal error 
rates up to which error correction is possible by exact 
error correction are known to be Pb^Pp < 10.9% for in- 
dependent bit- and phase- flip errors [5] and p < 18.9% 
for depolarizing noise [18 . Any approximate error correc- 
tion algorithm will yield threshold error rates below these 
theoretical maxima. For error rates below the threshold 
values, the probability of a logical error decreases expo- 
nentially with L. 

Our algorithm is based on an analytically exact rewrit- 
ing of the probability of each equivalence class which al- 
lows evaluation with the Metropolis algorithm. Let us 
discuss depolarizing noise here and note that our dis- 
cussion generalizes straightforwardly to arbitrary error 
models of the form of Eq. ([T]). We have defined a de- 
polarization rate p to mean that each spin has suffered a 
(j^ ^ (jy ^ or (j^ error with probability p/3 each and no error 
with probability 1 — p. Consequently, the probability of 
an error chain involving n single-qubit errors is up to a 

normalization constant given by (^^z^^ = where 

P is defined through 
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Given an anyon configuration A, the relative probability 
of equivalence class E can be written as 
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where the sum runs over all error chains that are com- 
patible with the anyon configuration A and elements of 
equivalence class and n denotes the number of single- 
qubit errors in a particular error chain. The goal is to 
find the equivalence class E with maximal Ze{P)' 

The Metropolis algorithm allows us to approximate ex- 
pressions of the form 
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(we use P to denote a generic "inverse temperature" and 
P to denote the specific one defined through Eq. ([2|). 
The sum is here over all error configurations in equiv- 
alence class E that are compatible with the syndrome 
information A. In order to approximate an expression of 
the form in Eq. Q, we pick one stabilizer at random and 
calculate the number An by which the total number of 
errors n in the code would change if that stabilizer were 
applied. If An < 0, we apply the stabilizer and if An > 
we apply it with probability e~^^^. Summing up /(n) 
over all steps and dividing by the total number of steps 
then yields our approximation to Eq. Q. 

Deforming error patterns only through the applica- 
tion of stabilizers ensures that all error patterns in one 



Markov chain belong to the same class, and that all of 
them are compatible with the same anyon configuration 
A. Since we will need the average {f{n))^ ^ for each 
equivalence class we need an initial error configuration 
from each equivalence class which is compatible with the 
measured anyon syndrome A. In fact, we will start the 
Metropolis Markov chains with the minimum weight er- 
ror configuration from each equivalence class. The reason 
for starting with the minimum weight error configuration 
rather than a random initial configuration from the same 
equivalence class is based on the intuition that "heating 
up" from the groundstate to inverse temperatures /3 as 
needed for the equilibrium averages in Eq. Q takes less 
time than "cooling down" from a high energy configura- 
tion. In order to find the minimum weight hypotheses 
from each equivalence class, first note that each hypoth- 
esis about what errors have happened has to link each 
p-anyon with a chain of errors to another p-anyon or 
to the corresponding type of boundary (left or right in 
Fig. [T]), and similarly with errors for s-anyons (which 
can be linked to the top or bottom boundary). A qubit 
which has suffered a cr^ error is then simply one on which 
a and a error have occurred. 

For a graph with weighted edges between an even num- 
ber of vertices, Edmond's MWPM algorithm |20^ finds 
the pairing of minimal weight efficiently. We employ 
the library Blossom V [21] to perform MWPM. For the 
graphs which are relevant for the surface code, MWPM 
can be performed in runtime complexity 0{L'^) and can 
be parallelized to 0{L^) [13]. In order to obtain such 
graphs, we assign to the edge between two anyons (cor- 
responding to the vertices) the minimal number of single- 
qubit errors necessary to link them (their Manhattan 
distance). For each anyon, we place a virtual partner 
on the closer boundary of the type which is able to ab- 
sorb it (top an bottom for s-anyons and left and right 
for p-anyons) . We then add an edge between each anyon 
and its virtual partner (with weight again given by the 
Manhattan distance) and zero-weight edges between all 
virtual anyons on the same boundary. Including these 
virtual anyons ensures that the number of vertices in the 
graph is even and that each anyon can be matched to 
the closest absorbing boundary. The zero-weight edges 
ensure that unnecessary virtual anyons can be removed 
at no cost. The four equivalence classes of errors in the 
surface code may be identified by determining the parity 
of the number of errors that lie on a given line that links 
the top and bottom or left and right boundary. In order 
to find the minimum weight error configuration of all four 
equivalence classes, we need be able to enforce changes 
in these parities. This can be achieved by adding one 
further virtual anyon on the top and bottom or the left 
and right boundary, respectively, and connecting it with 
zero-weight edges to all virtual anyons already present 
at this boundary [19j. It is thus guaranteed that a any 
pairing of all real and virtual anyons of the same type is 
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FIG. 2. Three p-anyons (solid squares) have been detected in 
a surface code. In order to find hypotheses of minimum weight 
(maximal probability) about what errors have occurred, we 
first add virtual anyons (light squares) on the closest absorb- 
ing boundary of each real anyon and connect virtual anyons 
residing on the same boundary; see the a) part of the figure. 
Dashed lines represent zero- weight edges, solid lines represent 
non-zero- weight edges. In part b) of the figure, we place an 
additional virtual anyon on the left and right boundary. Note 
that each possible pairing in part b) is an element of a differ- 
ent equivalence class then the pairings which are possible in 
part a). 



an element of a different equivalence class than the one 
obtained if these two additional virtual anyons had not 
been included. See Fig. [2] for an illustration. 

Note that in Eq. Q has 2^^*^^ summands for each 
equivalence class so knowing an averaged sum is as 
good as knowing the whole sum. We have 
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corresponding to a simple Monte Carlo sampling of the 
sum. However, the sum is dominated by an exponen- 
tially small fraction of summands with "energy" n close 
to the minimal value, so Monte Carlo sampling is com- 
putationally similarly expensive as a brute force calcu- 
lation of the sum. Our goal is thus to rewrite Ze{P) in 
a way that involves only quantities which are evaluable 
efficiently with the Metropolis algorithm. Applying the 
fundamental theorem of calculus we have 

log Ze W) = / dpd^ log Ze W) + log ZeW = 0) 
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If we know the functions (n)^ the most likely equiv- 
alence class is, according to Eq. (|6|, the one in which 
the area under the curve is smallest. In the correctable 
regime {p < Pc) the differences in "free energy" 
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between the different equivalence classes grow propor- 
tionally in L and correspondingly the probability of all 
equivalence classes but the most likely one decreases ex- 
ponentially with L. However, as the values of the inte- 
grand (n)^ ^ grow with L^, the relative differences in free 
energy decrease with L. 

For a positive /3, the average number of errors (n)^ ^ 
can be efficiently calculated to arbitrary accuracy_ by 

means of the Metropolis algorithm. The integral d/3 
can be calculated to arbitrary accuracy by first calculat- 
ing the values (n)^ ^ for a sufficient number of inverse 
temperatures j3 and then applying a quadrature formula 
like Simpson's Rule. We conclude that the free energy 
Fe{P) and hence the probability of each equivalence class 
can be efficiently calculated to arbitrary accuracy - in 
contrast to the idea that error correction algorithms are 
either inefficient or suboptimal. 



THE SINGLE TEMPERATURE ALGORITHM 

Recall that we are not interested in the precise value 
of the integrals d(3 (n) ^ but only in knowing for 
which equivalence class E this integral is smallest. For 
this reason, calculating the whole integral is quite often 
an overkill. In fact, most of the relevant information con- 
tained in the function (n)^ ^ can be extracted by finding 
its value for a single inverse temperature 

Assume that we determine the values (n)^* ^ for some 
/3* > for all equivalence classes E. If the functions 
(n)^ ^ for the different equivalence classes do not cross, 
knowing the values (n)^* ^ is as good as knowing the 

whole integrals d/3 (n)^ ^ for deciding for which equiv- 
alence class E the integral is smallest. 

As /3 ^ 0, each qubit is affected by an x-, y-, or z- 
error or no error at all with probability ^, so {n)^ ^ 

l^qubits, where nqubits = ^stab + 1 is the number of data 
qubits in the code. The low-/3 tail of the function (n)^ ^ 
thus contains almost no information about the equiva- 
lence class E. So while the integral d/3 (n)^ ^ is dom- 
inated by its low-/3 part ((n)^ ^ is a monotonically de- 
creasing function of the differences between these in- 
tegrals for the different equivalence classes are mainly 
due to their high-/3 part. So even if there are crossings in 
the low-/3 tails of the functions (n)^ ^, basing the deci- 
sion for the most likely equivalence class on a single value 
(^)/3* E likely to yield the same outcome as basing the 

decision on the whole integral dP (n) ^ ^. We thus 
define our single-temperature algorithm as sampling the 
values (n)^* ^ for all equivalence classes and performing 
error correction in accordance with the equivalence class 
E for which this value is smallest. 

This algorithm has only two free parameters, namely 
/3* and the number of steps for which we perform the 



Metropolis algorithm in order to sample (n)^* ^. Note 
that for /3* ^ the single-temperature algorithm be- 
comes useless ((n)^*^Q^ = fnqubits for all equivalence 
classes) and for /3* ^ oo the single-temperature algo- 
rithm coincides with MWPM as it bases its decision on 
the minimum- weight error chain of each quivalence class. 
So if the single-temperature algorithm is able to outper- 
form MWPM, it has to be based on /3* ^ p. Empiri- 
cally, we find that in the case of depolarizing noise chos- 
ing a value for /3* different from P does not offer much 
room for improvement, so we set = P for this er- 
ror model. As for the second parameter, we may ask 
how many Metropolis steps are necessary for our single- 
temperature algorithm to achieve logical error rates be- 
low those achievable with MWPM. 

In order to set the bar high, we compare our algorithm 
with a somewhat enhanced version of MWPM. This uses 
our above described method of enforcing changes in the 
equivalence class for the MWPM algorithm, thus allow- 
ing us to find two non-equivalent minimum-weight er- 
ror chains for both kinds of anyons. Combining these 
2x2 error chains gives four hypotheses about the errors 
that have happened. Each is the most likely within its 
equivalence class for an approximate error model where 
the correlations between x- and z-errors are ignored (the 
only one MWPM can treat straightforwardly). We can 
then determine which of the four is most likely overall ac- 
cording to the true depolarizing noise model - i.e., deter- 
mine the hypotheses with the minimal number of errors, 
where one x- and one z-error on the same qubit count as 
one y-eiTOT. As we show below, this enhanced version of 
MWPM indeed leads to lower logical error rates as stan- 
dard MWPM (as performed for example in Ref. [22 ), as 
it does not completely ignore correlations between x- and 
z-errors [23 . 

Fig. |3] shows the number of Metropolis steps which are 
necessary for our single-temperature algorithm to achieve 
the logical error rates achievable with enhanced MWPM, 
and illustrates that already an 0{L^) runtime is suffi- 
cient for this. The inset in Fig. [S] shows the logical error 
rates achievable with our single-temperature algorithm if 
the quantities (n)^* ^ are sampled over lOL^ Metropolis 
steps. For values of p below 15%, the logical error rate de- 
creases exponentially with L, meaning that the threshold 
error rate for our algorithm is significantly below the the- 
oretical maximum of 18.9% [18 and the value of 18.5% 
achieved in Ref. [14 but closer to threshold of MWPM 
[22] , However, the relevant figures of merit in practice 
are the logical error rates achievable well below threshold 
(where our algorithm offers significant improvement over 
MWPM, see below) and the runtime complexity (where 
our algorithm offers significant improvement over the al- 
gorithm of Ref. [14 ) . 

The quantity which determines whether error correc- 
tion will be successful is the difference min (n)^* ^^-^^^ — 
(^)/3*,true' where min (n)^* f^^^g^ denotes the minimal av- 
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FIG. 3. The plot shows nsampie/i>^ against L for the case of 
depolarizing noise with p — 14%. Here risampie is the mini- 
mum number of Metropolis steps over which the quantities 
(^)/3* E (with /3* = ^) must be sampled for each equiv- 
alence class in order for the logical error rate of the 
single temperature algorithm to be below that of enhanced 
MWPM. Each data point (shown in blue) is averaged over 
as many error configurations as were required for 2 000 log- 
ical errors to occur. The ratio nsampie/^"^ clearly does not 
diverge. Indeed, the agreement with the function 0.93L~^ °° 
(purple line) shows that it seems set to vanish as L ^ oo. 
This shows that the complexity of risampie ^ O(L^), and ap- 
pears to be as low as risampie ^ O(L^). The inset shows 
the logical error rates of our single-temperature algorithm 
alone with risampie = lOL^, for various system sizes L (hori- 
zontal axis) and depolarization rates p (from bottom to top 
p = 13%, 14%, 15%, 15.5%, 16%, 17%). 

eraged number of errors of all three false equivalence 
classes. This difference is displayed for various values 
of p and L in Fig. [4j For p < 16% this difference in- 
creases linearly with L, while for p = 17% it becomes 
even negative for large enough L. This is to be expected: 
for an error rate sufficiently close to the 18.9% thresh- 
old, each of the averages (n)^* ^ for the four equivalence 
classes E has the same probability for being the smallest 
one, such that the probability that one of the three false 
equivalence classes becomes minimal approaches |. 

Fig. [5] compares the logical error rates achievable with 
different algorithms for L = 15 and different depolariza- 
tion rates p. We set the logical error rates achievable 
with standard MWPM to unity and divide them by the 
logical error rates achievable with alternate algorithms. 
The algorithms which are displayed are: 

A) Standard, unimproved MPWM, as employed in 
Ref. [22 . An x- and a z-error on the same qubit 
count as two errors. 

B) Enhanced MWPM. An x- and a z-error on the same 
qubit count as two errors during the matching, but 
as one error during a final comparison of all equiv- 
alence classes. 

C) Single-temperature algorithm with quantities 




FIG. 4. The plot shows the distinguishabilities 

min (n)^*^f^^g^ - (n)^,^^^^^ (vertical axis; we use /3* = ^) 
for various depolarization rates p and code sizes L (hor- 
izontal axis). Different lines correspond to different 
depolarization rates p. From top to bottom we have 
p = 13%, 14%, 15%, 15.5%, 16%, 17%. All quantities (n)^,^^ 
are sampled over lOL^ Metropolis steps and each data point 
is averaged over as many error configurations as are necessary 
to obtain 2 000 logical errors. 

(^)/3* E sampled over lOL^ Metropolis steps. 

D) Calculation of the entire integral d/3 (n)^ ^ 
through first sampling (n)^ ^ over lOL^ Metropolis 

steps for 21 equidistant values of /3 (0, . . . , /^) 
and then applying Simpson's quadrature formula. 

E) Single-temperature algorithm with quantities 
(n)^^ ^ sampled over lOOL^ Metropolis steps. 

F) The parallel-tempering algorithm developped in 
Ref. pij. 

The logical error rates decrease from A to F, while the 
runtime complexities increase (with the exception of al- 
gorithm E requiring less time than algorithm D). We see 
that the advantages achievable over algorithm A van- 
ish as p ^ = 18.9% but increase the lower p gets. 
Note that algorithm D shows only a marginal advantage 
over algorithm C, as expected. We believe our single- 
temperature algorithm (C and E) to offer the most at- 
tractive trade-off ratio between low logical error rates and 
low classical runtime complexity, as they increase the lat- 
ter only modestly compared with algorithms A and B, 
while algorithm F has a super-polynomial (in L) runtime 
complexity. Algorithm D is strictly dominated by algo- 
rithm E (lower runtime and lower logical error rate). 

Fig. |6] shows the logical error rates of algorithm A di- 
vided by those of algorithm C for various values of p and 
L. We see the advantage of algorithm C increasing for 
lower p and larger L. Note that we expect real quantum 
computers to be operated at error rates p below and code 
distances L above those displayed in Fig. [6j 
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FIG. 5. The plots show the logical error rate of MWPM 
(algorithm A in the main text) divided by the logical error 
rate of algorithms A to F described in the main text, for 
various depolarization rates p (horizontal axis) and a code 
of linear size L = 15. Each data point is averaged over as 
many error configurations as are necessary to obtain 2 000 
logical errors. A value greater than 1 denotes an increase in 
effectiveness over MWPM, with greater increases for higher 
values. 
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FIG. 6. The plots show the logical error rate of algorithm 
A divided by the logical error rate of algorithm C for various 
depolarization rates p (horizontal axis) and code sizes L. Each 
data point is averaged over as many error configurations as 
are necessary to obtain 2 000 logical errors. 



To get an idea about what effect this reduction of the 
logical error rates has on the necessary code size, let us 
have a look at the code size which is needed for a proof 
of principle experiment. I.e., given some physical error 
rate p, which code size L is needed to bring the logical 
error rate below the physical one? For p = 13%, we need 
L > 6 in order to achieve a logical error rate below p 
with algorithm C and need L > 15 with algorithm A. So 
by modestly enhancing the classical runtime complexity, 
we are able to reduce the number of physical data qubits 
required for such a proof of principle experiment from 
421 to 61. 

The advantage of algorithm C over algorithms A and 



B is due to at least two different reasons. First, MWPM 
is naturally suited to the error model of independent bit- 
and phase-flips, where each type of anyon can be treated 
independent of the other. It is much less suited to error 
models such as depolarizing noise which feature correla- 
tions between bit- and phase-flip errors. Algorithm B can 
only partially overcome these limitations. Second, while 
algorithms A and B are based on finding the most likely 
error chain and hoping that it is an element of the most 
likely equivalence class, algorithms C to F are based on 
finding the most likely equivalence class. An interesting 
question is thus how our single-temperature algorithm 
compares with MWPM for independent bit- and phase- 
flip errors, the error model MWPM is best suited to and 
where only the second advantage applies. Empirically, 
we find that for this error model chosing /3* = O.SbP 
works best and that for a bit-/phase-flip probability of 
10% (close to the theoretical threshold) 0{L'^) Metropo- 
lis steps are again sufficient to achieve a logical error rate 
below the one of MWPM, with 0.38L^ '^^ giving the best 
fit to the required number. For L = 30, a bit- /phase-flip 
probability of 8%, and by sampling over lOL^ Metropo- 
lis steps we achieve a logical error rate which is a fac- 
tor 1.3 lower than the one of MWPM. So significant im- 
provements over MWPM can be achieved even for the 
error model best suited to it, though the advantage is 
much more drastic for an error model with correlations 
between bit- and phase-flip errors, with which our single- 
temperature algorithm can deal very naturally. 

Also note that there are quantum error correction 
codes such as the toric code with higher-than-two- 
dimensional qudits to which MWPM is not applicable. 
In contrast, the algorithm presented here is readily gen- 
eralizable to any kind of stabilizer code. 



PARALLELIZATION 

The runtime of our algorithm can be reduced by a fac- 
tor 0{L'^) using parallelization which exploits the fact 
that our algorithm needs local changes only. We partition 
the whole code into 0{L'^) rectangles of area 0{L^). Ad- 
jacent rectangles overlap along lines of data qubits, while 
qubits in the corners belong to four rectangles, see Fig.[7| 
The rectangles are collected into four groups 0, 1, 2, 3 such 
that the rectangles within one group have no overlap- 
ping qubits. At step i of the Metropolis Markov chain, 
we choose one stabilizer in each rectangle belonging to 
group {i mod 4) at random and probe whether to apply 
it or not according to the Metropolis procedure. This 
way we can guarantee that no data qubit is affected by 
more than one applied stabilizer at each step. If a ran- 
domly chosen stabilizer is applied and flips a qubit which 
is shared with an other rectangle (other rectangles), this 
flip is communicated to the adjacent rect angle (-s). For 
each rectangle, we add up the number of local errors n 
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FIG. 7. The dashed hnes partition the code into rectangles 
of size 0(1). Data qubits along the boundaries belong to two 
rectangles and data qubits in the corners to four. 



over the different Metropolis steps and calculate the total 
average (n)^^ ^ in the end. To compensate for double- 
(quadruple-) counting, errors on qubits along the bound- 
ary thereby have to be discounted by a factor ^ and errors 
on qubits in the corners by a factor ^. As we probe now 
O(L^) stabilizers in each time step, the runtime reduces 
from 0(L") to 0(L"-2). 



FIG. 8. Only one type of errors and stabilizers and a sin- 
gle chain of data qubits (black dots) and syndrome qubits 
(grey dots) are depicted here for simplicity. Time runs from 
bottom to top. Stabilizer operators are measured at times 
...,t — l,t,t+l, — The two encircled syndrome qubits have 
detected errors ( — 1 eigenvalues). A possible hypothesis is 
that no data qubit has suffered an error and both syndrome 
measurements have been erroneous. An alternate hypotheses 
would state that the data qubits indicated by red lightning 
bolts have suffered errors and that syndrome measurement by 
use of the syndrome qubits indicated by red lightning bolts 
has been erroneous. Different equivalent hypotheses can be 
deformed into each other through the application of operators 
like the red square that invert whether a hypothetical error 
has happened or not at two adjacent syndrome qubits and 
one data qubit at two subsequent times. 



IMPERFECT STABILIZER MEASUREMENTS 

Let us assume that each stabilizer measurement yields 
the wrong result with probability pm- If stabilizers are 
measured by use of CNOT gates, this model is a simpli- 
fication in that it ignores correlations between physical 
qubit errors (induced by the channel Eq. ([T])) and mea- 
surement errors [24 . In order to make the failure proba- 
bility small despite non- negligible pm, we now necessarily 
need to perform stabilizer measurements at several times 
t = 1,2,..., tmax- A hypothesis about what errors have 
happened then not only has to state which data qubit 
has suffered an error in which time interval [t, t + but 
also which stabilizer measurement has been erroneous at 
which time t. Such hypotheses can be deformed into 
equivalent ones by applying a bit-/phase-flip a^/cr^ to a 
particular qubit at time intervals [t — l^t] and [t^t + 1] 
and inverting the hypothesis about whether the stabilizer 
measurements at time t that anti-commute with this er- 
ror have been erroneous (see the illustration in Fig. [8|. 

In the case of depolarizing noise, a hypothesis that 
involves n data-qubit errors and m erroneous syndrome 



measurements has a relative probability 



1-p 



: exp [-P{n + ^ 



where f3 is as defined in Eq. ^ and 



P PM 



Pm 



(8) 



(9) 



The "energy" of a hypothesis is thus given by n + 
where ^ determines the relative weight of erroneous syn- 
drome measurements to data qubit errors. Our method 
to find the most probable equivalence class in the case of 
perfect stabilizer measurements can thus be generalized 
to the case where syndrome measurements fail with a 
considerable probability. Numerical results for the latter 
case will appear in future work. 



CONCLUSIONS 

We have shown that adding runtime complexity O(L^), 
with £ < 2, to the fastest surface code error correction 
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algorithms [T2j [13] allows us to attain logical error rates 
significantly below those achievable with MWPM. More 
precisely, it appears that e G [1.00,1.77] is sufficient for 
error models which are mixtures of depolarizing noise 
and independent bit- and phase-flip errors. This means 
that in order to reach a certain target logical error rate, 
the surface code may be significantly smaller if our algo- 
rithm is applied. While for error rates close to the thresh- 
old, where error correction is almost infeasible, both al- 
gorithms perform similarly, the advantage of our single- 
temperature algorithm over MWPM increases as error 
rates are decreased (and code sizes are increased). This 
makes it particularly relevant in practice, as quantum 
computers will be operated at error rates significantly 
below threshold, and the ability to use smaller code sizes 
will have a significant practical advantage. 
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